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1.0  BACKGROUND 


The  ASP  arctangent  algorithm  Is  used  In  the  computation  of  DIFAR  bearings 
and  also  In  applications  which  require  c.iculatlon  of  complex  phase  angles. 

The  algorithm  currently  available  has  been  Implemented  In  AP  microcode  ano  is 
fully  described  in  reference  (a)  of  Section  2.0.  The  microprogram  name  Is 
"8C".  The  current  algorithm  Is  also  described  in  Section  3.0  of  this  report. 

The  throughput  rate  of  the  "BC"  microprogram  Is  low  because  the  current 
algorithm  cannot  fully  exploit  the  pipeline  features  of  the  AP.  The  new  algo¬ 
rithm  provides  a  2:1  throughput  gain  primarily  because  of  Its  suitability  for 
pipelining.  This  algorithm  was  brought  to  the  author's  attention  by 
Mr.  Morris  Plotkln  (Code  503)  of  the  Naval  Air  Development  Center. 

2.0  APPLICABLE  DOCUMENTS 


a.  CDRL  No.  A055, 

IBM  6259764, 

Revision  through  2.5.2 


W2140  Computer  Mircoprogram  Design 
Document,  Arithmetic  Processor 
(APMDD),  "BC"  Microprogram. 


b.  CDRL  No.  A008, 
IBM  Corporation, 
30  June  1977 


Advanced  Signal  Processor  Service 
Test  Model  ASP  Analyzer  Unit 
Principles  of  Operation,  Volume  1. 
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3.0  DESCRIPTION  OF  CURRENT  ALGORITHM 

The  current  algorithm  nay  be  expressed  as  follows: 

9  ■  «B  ♦/-  tan-1  ( | x ! / | y | )  (1) 

where  x  ■  East/West  Amplitude 
y  ■  North/South  Anplitude 

9g-  Quadrant  Base  Angle 

+/-  a  for  quadrants  1  and  3 

for  quadrants  2  and  4 

The  algorithm  executes  In  the  AP  of  the  ASP  AU.  The  AP  is  a  pipelined,  micro- 
programmable  unit  which  has  been  designed  to  produce  high  throughput  for  sig¬ 
nal  processing  algorithms.  Tc  minimize  hardware  cost,  the  AP  has  been  provi¬ 
ded  with  limited  decision-making  capability.  It  also  lacks  both  hardware  di¬ 
vide  and  inverse  tangent  functions,  since  common  signal  processing  algorithms 
such  as  Digital  Filters  and  FFT's  do  not  require  them. 

In  the  current  Implementation  of  the  arctangent  algorithm,  the  term 
I x | / | y |  Is  calculated  via  Iterative  loop  which  yields  one  additional  bit  of 
precision  per  pass.  The  loop  employs  add/subtract  operations  only.  The  in¬ 
verse  tangent  Is  approximated  with  2  degree  precision  via  an  Inverse  table 
look-up,  Implemented  by  means  of  a  6-pass  binary  search.  Both  the  divide  and 
Inverse  table  look-up  operations  are  extremely  Inefficient. 

The  alternate  Implementation  to  be  proposed  provides  a  2. 1  gain  In  speed 
by  reducing  the  precision  required  for  the  divide  and  approximating  the  In¬ 
verse  tangent  with  a  simple  polynomial.  It  also  increases  the  precision  of 
the  arctangent  from  2  degrees  to  1  degree. 
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4.0  THEORETICAL  BASIS  FOR  THE  NEW  ALGORITHM 

The  new  algorithm  is  based  on  the  following  trigonometric  Identity: 

t an-1 ( j x 1 / 1 y | )  a  45°  +  tan"1 ( r)  (2) 

where  r  =  ( |x|-|y|)/(|x|+|y| ) 

This  transformation  offers  two  computational  advantages  for  the  AP: 

a.  | r |  requires  a  dynamic  range  of  only  0  to  1,  whereas  |x|/|y|  re¬ 
quires  a  range  of  0  to  Infinity.  As  a  result,  7  significant  bits  In 
r  yield  the  same  precision  as  14  significant  bits  In  |x|/|y|.  Since 
the  calculation  time  Is  proportional  to  the  number  of  significant 
bits  required,  r  can  be  calculated  twice  as  fast  as  | x | / |y | . 

b.  For  | r |  <  l,  tan**(r)  can  be  approximated  to  1  degree  precision  by 
the  polynoml  al : 

56.31213i'  -  11.61213r3  (3) 

This  polynomial  can  be  evaluated  in  half  the  time  required  to  perform  the 
binary  search  currently  employed  to  achieve  2  degree  accuracy,  primarily  be¬ 
cause  no  logical  decisions  are  required.  i 

I 

The  computational  disadvantages,  enumerated  below,  are  small 
compared  to  the  advantages: 

a.  r  ■  ( | x | -» | y | )/( |x|  +  |y| )  Is  more  difficult  to  evaluate  than  |x|/|y|. 
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b.  Since  r  may  be  positive  or  negative  and  the  division  technique  re¬ 
quires  that  both  numerator  and  denominator  be  positive,  |r|  must  be 
calculated  and  then  complemented  as  a  separate  step  when  |y|  >  |x|. 
The  division  technique  Is  described  in  Section  D.8  of  reference  (b). 

5.0  PROPOSED  IMPLEMENTATION 

The  arctangent  algorithm  expressed  by  Equation  (2)  shall  be  approximated 
to  1  degree  accuracy  by: 

e  -  Iob"  ♦  56.31213r  -  11.61213r3|  (4) 

where  r  ■  (|x|-|y|)/(|x|+|y|) 

and  03"  ■  quadrant  base  angle  provided  In  the  Table  below 

x  ■  East/West  Amplitude  or  Imaginary  part 
y  ■  North/South  Amplitude  or  real  part 


Quadrant  | 

I 

X 

1  y 

1  «b" 

|eB' (Hex) | 

1  1 

1  | 

x>0 

y>o 

|  +45° 

!  1 
|  002D  | 

2  1 

x>0 

y<0 

|  -135° 

1  FF79  1 

3  1 

x<0 

y<0 

|  +225° 

1  ooei  | 

4  | 

x<0 

y>0 

|  -315° 

|  FEC5  | 

Table  1  Quadrant  Base  Angles 
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The  quantity  "r"  shall  be  obtained  ty  calculating  |r|  via  the  iterative  method 
described  in  Section  D. 8  of  reference  (b)  and  then  correcting  the  algebraic 
sign  as  required.  The  45  degree  angle  increment  arising  from  the  transforma¬ 
tion,  tan'1  (|x|/jy|)  *  45°  +  tan'^r),  has  been  incorporated  into  the 
quadrant  base.  The  quadrant-dependent  +/-sign  in  equation  1  has  been  handled 
by  complementing  the  quadrant  base  for  quadrants  2  and  4  and  then  taking  the 
absolute  value  of  the  final  result.  This  procedure  is  more  efficient  for  com¬ 
putation  since  it  eliminates  the  need  for  logical  decisions, 

6.0  SIZING  AND  TIMING 

The  microcode  required  for  implementation  of  the  above  has  been  roughly 
written  by  the  author  in  order  to  demonstrate  feasibility  and  obtain  sizing 
and  timing  estimates.  The  following  results  were  obtained: 

Sizing:  66  microinstructions  -  new  implementation 

60  ini croi ns t ructions  -  current  Implementation 

Timing:  2.2  +  3.4N  usee  -  new  implementation 

26.0  +  7. 1  N  usee  -  current  implementation 

where  N  =  number  of  arctangents  to  be  calculated 

The  sizing  figures  Indicate  that  the  amount  of  microstorage  required  is  nearly 
comparable  between  the  two  implementations.  With  regard  to  Working  Storage 
utilization,  the  48  word  tangent  table,  used  cur,"ently  in  conjunction  with  the 
binary  search  for  Inverse  tangent  approximation,  is  no  longer  needed.  The  ad¬ 
ditional  Working  Storage  available  under  the  new  implementation  may  be  used  to 
process  larger  input  data  sets. 
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The  above  timing  figures  Indicate  that  the  execution  time  is  more  than 
halved  for  the  new  implementation.  An  additional  timing  advantage  results 
from  the  fact  that  two  AE's  may  now  be  used  to  process  a  pair  of  input  data 
sets  in  parallel.  Two  AE's  may  not  be  utilized  with  the  current  implementa¬ 
tion  because  the  binary  search  requires  data  -  dependent  branching.  Since 
both  AE's  must  exea  te  the  sane  microinstruction  at  all  times  (there  is  only 
one  AE  Controller),  data-dependent  branching  is  not  possible. 

7.0  SCALING  CONSIDERATIONS 

Thje  multiple  constants  56.31213  and  -11.61213  must  be  expressed  as  16-bit 
fixed-point  values.  For  maximum  precision,  a  scaling  index,  SI*7,  shall  be 
used.  SI»7  indicates  that  the  decimal  point  is  placed  after  the  seventh  bit. 
counting  left  to  right  in  the  16-bit  word.  With  this  scaling: 

I 

56.31213  shall  be  expressed  as  28,832,  base  10,  which  equals  hex  70A0. 

-11.61213  shall  be  expressed  as  -5945,  base  10,  which  equals  hex  E8C7. 

Thje  integer  values  are  obtained  ty  multiplying  each  constant  by  512  and 
roundirjg.  The  negative  constant  is  expressed  in  hexadecimal  as  a  two's  com- 

j 

plement. 

The  ratio,  r,  shall  be  calculated  by  the  microprogram  with  SI«1  and  7 
significant  bits  (plus  sign).  The  low  eight  bits  will  always  be  zero.  The 
archtangent  shall  be  expressed  as  a  pure  integer,  that  is,  with  S I =16. 
Therefore,  to  achieve  proper  scaling,  equation  (4)  must  be  implemented  as 
fol lows: 

e  *  eB'  +  j (56. 3121 3r/4  -  11. 61213r3)/64|  (5) 
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The  term  56.3l213r  has  SI  *  7  +  1  =  8.  The  term  -  11.61213  has 
SI  *  7  +  1  +  1  +  1  *  10.  Division  of  56.31213r  by  4  changes  the  SI  from  8  to 
10  so  that  both  terms  will  be  scaled  equivalently  before  addition.  Division 
of  the  sum  by  64  thanges  the  resultant  SI  from  10  to  16,  as  required.  The  hex 
values  of  9b*  the  ^ast  coluiiin  of  Table  1  are  expressed  with  SI  ■  16  so  no 
scaling  of  eB*  required. 

The  divisions  by  4  and  64  may  be  accomplished  via  AP  harckare  through  the 
use  of  a  scale  factor  register  (refer  to  reference  (b)).  The  division  by  4 
may  be  achieved  via  a  2-blt  pre-scaler  right-shift.  The  division  by  64  may  be 
achieved  via  a  6-bit  post-scaler  right-shift. 

8.0  IMPACT  OF  THE  NEW  ALGORITHM  ON  EXISTING  OPERATIONAL  SOFTWARE 

If  a  microprogram  employing  the  new  Implementation  were  coded  and  used  in 
place  of  the  current  ml croprogram,  the  Improvement  In  performance  for  two 
typical  operational  software  packages  Is  described  below: 

a.  P-3C  AU  Operational  Software/D  IF AR  Processing  Option:  Approximately 
7.5  ms  per  PIC  would  be  saved  each  time  ALI  output  was  requested. 
This  savings  would  not  significantly  reduce  total  loading  require¬ 
ments.  It  would,  however,  tend  to  offset  the  additional  load  re¬ 
quired  to  Implement  double-precision  Integration  of  the  directional 
channels,  required  to  solve  the  current  DIFAR  bearing  accuracy 
problem  In  the  presence  of  directional  noise. 

b.  Proposed  AU  Operational  Software  for  ERAPS:  Approximately  1.7%  of 
total  throughput  capacity  would  be  saved  per  sensor  during  operation 
in  the  ERAPS  bearing  resolve  mode.  This  saving,  although  non- 
negligible,  would  not  significantly  reduce  total  loading  require¬ 
ments. 
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9.0  CONCLUSION 

The  proposed  implementation  of  the  Bearing  Calculation  algorithm  for  the 
ASP  Analyzer  Unit  can  improve  effective  execution  time  by  more  than  4:1  with¬ 
out  a  significant  Increase  In  microstorage  requirements.  Its  effect  on  the 
total  loading  of  current  and  planned  operational  software  for  the  AU,  however, 
is  small.  Nevertheless,  It  Is  recommended  that  a  new  microprogram  be  devel¬ 
oped  for  future  use.  Its  effect  In  reducing  total  loading  could  be  signifi¬ 
cant  for  applications  which  make  more  extensive  use  of  DIFAR  bearing  or  com- 
olex  phase  calculatirns.  The  phase  of  any  complex  quantity  may  be  calculated 
via  the  same  algorithm. 
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